Unsupervised Analysis of the Voynich Manuscript
نویسنده
چکیده
The aim of this project is to research the possibilities of applying unsupervised learning techniques for natural language and other sequential data to undeciphered texts and manuscripts. The undeciphered text used is the Voynich Manuscript, a mysterious book from the 15th or 16th century that is written in an unknown script. Some methods that could be applied to manuscripts such as these will be discussed. Furthermore, the results of applying some of these techniques to the text of the manuscript will be discussed.
منابع مشابه
How the Voynich Manuscript was created
The Voynich manuscript is a medieval book written in an unknown script. This paper studies the relation between similarly spelled words in the Voynich manuscript. By means of a detailed analysis of similar spelled words it was possible to reveal the text generation method used for the Voynich manuscript.
متن کاملAnalysis of Letter Frequency Distribution in the Voynich Manuscript
The Voynich manuscript is one of the biggest mysteries in linguistic science. Although a lot of researches are being made, the author, the origin and the content of the manuscript still remain unknown. In this work letter frequency distributions of about 300 languages were compared to one of the language in the Voynich manuscript. The study shows the most similar languages according to this cha...
متن کاملCo-Occurrence Patterns in the Voynich Manuscript
The Voynich Manuscript is a medieval book written in an unknown script. This paper studies the distribution of similarly spelled words in the Voynich Manuscript. It shows that the distribution of words within the manuscript is not compatible with natural languages.
متن کاملStatistical Analysis of Unknown Written Language: The Voynich Manuscript
The Voynich Manuscript is a document written in an unknown language or cipher. This research proposal presents an idea into determining possible relationships within the Voynich. This is to be performed through known statistical methods relating to linguistics. The document reviews previous research carried out by other researchers. The proposed method is given and shows the current results obt...
متن کاملStatistical Properties of European Languages and Voynich Manuscript Analysis
The statistical properties of letters frequencies in European literature texts are investigated. The determination of logarithmic dependence of letters sequence for one-language and twolanguage texts are examined. The pare of languages is suggested for Voynich Manuscript. The internal structure of Manuscript is considered. The spectral portraits of two-letters distribution are constructed.
متن کامل